Speect: a multilingual text-to-speech system
نویسنده
چکیده
This paper introduces a new multilingual text-to-speech system, which we call Speect (Speech synthesis with extensible architecture), aiming to address the shortcomings of using Festival as a research system and Flite as a deployment system in a multilingual development environment. Speect is implemented in C with a modular object oriented approach and a plugin architecture, aiming to separate the linguistic and acoustic dependencies from the run-time environment. A scripting language interface is provided for research and rapid development of new languages and voices. This paper discusses the motivation for a new text-to-speech system as well as the design architecture and implementation of the system. We also discuss what is still required in the development to make the new system a viable alternative to the Festival Flite tool-chain.
منابع مشابه
Introducing the Speect speech synthesis platform
We introduce a new open source speech synthesis engine and related set of tools: Speect is designed to be a portable and flexible synthesis engine, equally relevant as a research platform and runtime synthesis system in multilingual environments. In this paper we document our approach to the rapid development of British English voices for the 2010 Blizzard Challenge using this platform and reso...
متن کاملThe Speect text - to - speech system entry for the Blizzard Challenge 2013
This paper describes the Speect text-to-speech system entry for the Blizzard Challenge 2013. The techniques applied for the tasks of the challenge are described as well as the implementation details for the alignment of the audio books and the text-to-speech system modules. The results of the evaluations are given and discussed.
متن کاملMultilingual text-to-phoneme mapping
This paper introduces a novel approach for generating multilingual text-to-phoneme mappings for use in multilingual speech recognition systems. The multilingual mappings are based on the weighted outputs from a neural network text-to-phoneme model, trained on data mixed from several languages. The multilingual mappings used together with a branched grammar decoding scheme is able to capture bot...
متن کاملRecent Advances in Multilingual Text-to-speech Synthesis
In this paper we will discuss recent advances in multilingual text-to-speech (TTS) synthesis research at AT&T Bell Laboratories. The TTS system developed at AT&T Bell Laboratories generates synthetic speech by concatenating segments of natural speech. The architecture of the system is designed as a modular pipeline where each module handles one particular step in the process of converting text ...
متن کاملMultilingual text analysis for text-to-speech synthesis
We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, ...
متن کامل